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Abstract —We view hyper-graphs as incidence 
graphs, i.e. bipartite graphs with a set of nodes 
representing vertices and a set of nodes representing 
hyper-edges, with two nodes being adjacent if the 
corresponding vertex belongs to the corresponding 
hyper-edge. It defines a random hyper-multigraph 
specified by two distributions, one for the degrees of 
the vertices, and one for the sizes of the hyper-edges. 
We develop the logical analysis of this framework and 
first prove a convergence law for first-order logic, then 
characterise the limit first-order theories defined by a 
wide class of degree distributions. Convergence laws of 
other models follow, and in particular for the classical 
Erdos-Renyi graphs and /.-uniform hyper-graphs. 

I. Introduction 

Understanding the graphs that occur in nature is an 
important challenge in computer science, as many phe¬ 
nomena can be seen as diffusion processes on graphs, such 
as the diffusion of ideas in society, from marketing to social 
uproar, or the spreading of epidemics [5]. Most natural 
graphs have a sparse structure yet are locally highly 
clustered, a property that classical random graphs fail to 
model. This motivated the introduction of a myriad of new 
models with more realistic features, with an overview of 
rigorous results in [8]. 

This paper considers the bipartite configuration model 
[2], interpreted as the incidence graph of a hyper¬ 
multigraph by considering one set of vertices to be hyper¬ 
edges, and the binary relation to specify that a vertex 
belongs to a hyper-edge. A random hyper-graph is speci¬ 
fied by two given degree distributions, one for the degrees 
of the vertices and one for the sizes of the hyper-edges 
(communities), and generates the uniform incidence graph 
on these degree distributions. The analogy with the real- 
world formation of graphs is compelling, since vertices 
are often linked by some shared community, actors may 
be linked because they play in the same film, scientists 
because they collaborate on a paper m , nsi- In society, 
communities can be of any size, involving just two vertices 
(friendships), a small number of vertices (families), up to 
very large communities (professions, urban areas). Hence 
this is a flexible model that generates a wide class of graphs 
by allowing a wide range of limit distributions, generates 
graphs with a sparse structure and dense communities, 
and models complex relations (see [TO] for an application 
of hyper-graph as abstractions of social networks). 


The logical analysis is one way to understand a random 
structure, by studying the probabilities that first-order 
formulas are satisfied. It has been started on random 
graphs independently in m and [9], a result celebrated 
as the zero-one law of random graphs. The whole of the 
Erdos-Renyi model was successfully charted , and [2T| 
extended results to k-uniform hyper-graphs with a differ¬ 
ent logical formalism, using a k -ary predicate. The logical 
structure traditionally used allows only to treat simple 
graphs; following [6], it is shown that using the incidence 
graph as logical structure allows to handle non-uniform 
hyper-multigraphs without modifying the logic, using just 
a binary predicate. This allows to study directly the logic 
of random hyper-multigraphs, and since this framework 
extends conservatively the classical one it allows to recover 
previous results on classical Erdos-Renyi graphs and k- 
uniform hyper-graphs as special cases. 

The main result is a convergence law for a class of ran¬ 
dom non-uniform hyper-multigraphs (Theorem [II). Besides 
natural conditions to avoid pathological sequences, the 
sequences in this class must have limit distributions with a 
bounded second moment and a sublinear fourth moment. 
This includes the case of Dirac distributions that generate 
hyper-graphs with uniform hyper-edge sizes. It extends a 
result of Lynch m that gave a convergence law for a class 
of graphs defined by their degree sequence, and contributes 
to the body of works extending classical results to hyper¬ 
graphs, such as [7] for the phase transition. Two sequences 
of random hyper-multigraphs are contiguous if they share 
the same limit theory, i.e. first-order formulas are either 
almost surely true on both sequences or neither, allowing 
to compare random sequences and models from a logical 
point of view. The possible limit theories are characterised 
and axiomatised, and from this the contiguity of random 
hyper-multigraphs is deduced (Theorem (21) . 

In the second section, the logical framework and nota¬ 
tions are introduced. In the third section, the considered 
class of random hyper-multigraphs is defined, and the 
convergence law is proved for this class. In the fourth 
section, the limit theories are investigated and a criterion 
for the contiguity of random sequences is deduced; then 
some distributions are compared to other models, such as 
Erdos-Renyi graphs. In the fifth section, some properties 
of interest are studied, and some remarks on future work 
conclude. 


II. Preliminaries 

In this section, the most general objects are defined first 
and more classical structures such as simple graphs are 
defined as specific cases. This is consistent with the logical 
model since graphs can be defined by conditioning on the 
validity of a first-order property. Some definitions may be 
non-standard to fit the context of this paper. 

Let (n)k denote the /c-falling factorial of n, (n)k = 
Ili=o ( n — *)• If S is a (countable) set, let p(S) denote 
the set of probability distributions on S, S be an element 
of p(<S), and s ~ S be a random variable sampled from S. 

A. Hyper-multigraphs and the incidence graph 

A (non-uniform, undirected) hyper-multigraph H = 
(V = {1 , ...,n},M : V(V) 4 N) is a set of vertices V 
equipped with a set of hyper-edges given with multiplici¬ 
ties. For a set e £ V{V), e £ E denotes that M(e) > 0, 
and we suppose that any e with M(e) > 0 is of size at least 
2. A hyper-multigraph is a hyper-graph if all multiplicities 
are 1, in which case the hyper-graph can be considered as 
a structure H = ({1,..., n}, E). A hyper-multigraph is k- 
uniform if every hyper-edge is of size k, and is a multigraph 
if it is 2-uniform. A graph is a 2-uniform hyper-graph. 
Let the classes of all hyper-multigraphs, hyper-graphs, 
multigraphs, and graphs be denoted respectively by Mpj, 
H, Gn, and G. Random hyper-multigraphs (elements of 
p(Hp*)) are denoted by H, and random hyper-multigraph 
sequences by (%„)„<= n- 

Two vertices v and v' are adjacent , denoted by r c o', 
if there is an e £ E such that v £ E and v' £ E. The 
converse notion, that there is no e £ E such that v £ e 
and v' £ e, is denoted by v x v'. The degree of a vertex 
v, denoted deg(u) is the number of edges containing v, i.e. 
|{e | e £ E Av £ e}\. These two notions can be specialised 
to (m, /c)-adjacency by specifying respectively the number 
to of hyper-edges, and the size k of the edges, i.e. v o™ v' 
if there are to hyper-edges of size k containing v and v'. 

The following definitions can be extended to hyper¬ 
multigraphs, but will only be used on simple graphs, 
usually denoted by G. The excess exc(G) of a graph G 
is |£j — \V\. A homomorphism of a graph g = (V g ,E) to G 
is an injection er : V g i-A V such that for all (v\,V 2 ) £ V 2 , 
v\ o i >2 implies a(v i) o cr(i> 2 ). A graph H is a subgraph 
of G if there is a homomorphism from Vh to a subset 
of Vg, and this subset is called a realisation of H. A 
subgraph is (asymptotically) realisable in a random graph 
Q if it is (asymptotically) realised probability bounded 
away from 0. A homomorphism is an isomorphism if it is 
bijective and (vi,V 2 ) £ V 2 , v\ o V 2 <t=> <j{v\) o er(i> 2 ). 
A rooted graph is a graph with a distinguished vertex 
(the root), and a rooted graph isomorphism is a graph 
isomorphism that maps the roots together. A graph G 
induces a distance between vertices defined as dc{x,y) = 
min{t | 3xi,..., x *+1 such that xq o • • • o Xt +1 A Xi = 
x A Xt+i = y} (by convention if this number is not defined 
it is considered to be infinite). A ball B(r , l) = {Vs,Es,r) 



(a) A small hyper-multigraph 
with a multiple edge and a 



(b) e-edges are implicitly oriented 


Fig. 1: A hyper-multigraph and its incidence graph 


of root r and radius l is the subgraph of G defined by 
Vb = {v £ Vg I d(v,r ) ^ 1} and Eb is the restriction 
of Eg to pairs of vertices in Vb- A neighbourhood of a 
graph denotes a ball of finite radius, and its type denotes 
its isomorphism class. 

A bipartite graph is a graph with two sets of vertices 
V and W, such that for all (v,v') £ V 2 (respectively 
IT 2 ), v x v'. Classes of bipartite hypergraphs, etc., can 
be defined, but in this paper only bipartite simple graphs 
are used as in the next definition: 

Definition 1. The incidence graph Iq of a hyper- 
multigraph G = (V, M) is a bipartite graph (V,Ei,£i) with 
Ei = {(e,i) | i ^ M(e)} such that v £ e ■£>■ v £1 ( e,i ). 
Each hyper-edge is represented by a node in Ei, and £- 
edges link vertices to the hyper-edges they belong to. 

Definition 2. Let I denote the class of incidence graphs, 
that is the set of bipartite graphs with an ordered pair of 
sets of vertices where every vertex in Ej has degree at least 

2 . 

The incidence graph of two hyper-multigraphs are iso¬ 
morphic if and only if the two hyper-multigraphs are 
isomorphic, therefore: 

Proposition 1. The class of hyper-multigraphs Hp} and 
the class of incidence graphs I are in bijection. 

This last property allows to use the incidence graph as 
a faithful representation of hyper-multigraphs. 

B. The bipartite configuration model 

The bipartite configuration model B(d v ,d e ) is a way 
to generate random hyper-multigraphs by specifying their 
corresponding incidence graphs. To generate an incidence 




(a) The lines symbolise the matching of the nodes 



(b) Nodes are collapsed by equivalence classes 


Fig. 2: A configuration and corresponding incidence graph 


graph on n vertices, this model takes two degree sequences, 
d v = (dg, d (,...) and d e = (dg, df,...such that d v (k) 
(respectively d e (k)) is the number of vertices in V (re¬ 
spectively in E) of degree k, with S = Y d v = Y d e . A 
configuration C = (f v : Cy <-P- V, f e : Ce '-t E, a) is then 
given by two sets of nodes Cy = Ce = {1,..., S }, two 
partitions according to d v , d e , and a random permutation 
<7 of {1,..., S} (equivalently, a random matching of Cy to 
Ce)- The incidence graph is then obtained by collapsing 
nodes in Cy into a vertex of degree k according to f v and 
collapsing nodes of Ce according to f e , and setting v £ e if 
and only if there exist c„, c e such that f(c v ) = v, f(c e ) = e, 
and cr(c v ) = c e . This process may not define an incidence 
graph since there may be multiple edges between pairs 
of vertices in V and E - in this case, it does not define 
a proper hyper-multigraph since hyper-edges are sets of 
vertices. However, with some conditions on the sequence 
of degree sequences (that the sequences considered here 
satisfy), it generates an incidence graph with positive 
probability [2], and the random structure obtained by 
conditioning on the absence of multiple £-edges is the 
uniform hyper-multigraph with degree sequences (d v , d e ). 
Figure[2]show a configuration and the corresponding graph 
with d e ( 2) = 1, d e ( 3) = 1, cf'(l) = 1, d v { 2) = 2 and 
all other degrees and edge sizes set to 0. Let V denote a 
sequence of degree sequences, i.e. V = (d\, d%, ■ ■ .) where 
each d n is a sequence of natural numbers (therefore d n (k ) 
is the number of vertices of degree k in the n -th graph). 
Let V v (resp. V e ) be the sequence (pi , ,...) with 

K( fc ) = JV\ d n( k ) (resp. Pn( k ) = p| d e n (k)). Then for each 
pair ( D v = (d\, d^, ■ . ■), T> e = (d \, d|,...)) of sequences, 
B(V V ,T> e ) denotes the random incidence graph sequence 
on the corresponding degree sequences. By definition, inci¬ 
dence graphs have no edge of size smaller than 2, therefore 
it is assumed that d®(0) = d®(l) = 0 for all n G N. 
We consider pairs of sequences of sequences that converge 
weakly to a pair of asymptotic degree distributions, denoted 


(d v , d e ), i.e. V v d v and V e -w d e . See [T] for details on 
the weak convergence of distributions, it implies that for 
all k 

lim p v n (k) = d v (k). 

n—>-oo 

Note that there may be no graph that realises a given de¬ 
gree sequence; however with some conditions on the limit 
distributions (that distributions considered here satisfy), 
asymptotically there is always a graph realising the degree 
sequence j4]. Another coherence condition is assumed, 
that the sums ]T) fc kd^(k) and Y^k mus t be equal; 

however the number of edges is adjusted depending on the 
number of vertices, so the average of d v may be different 
from the average of d e . 

C. Logic 

The language of first-order (FO) logic on graphs is 
defined as the closure under boolean connectives and quan¬ 
tifiers of equality and a binary predicate. Usually the vari¬ 
ables are interpreted as vertices and the binary predicate is 
interpreted as the adjacency relation c, however doing so 
makes inexpressible any property about edge multiplicities 
or hyper-edges. Instead, having as set of variables the 
union of vertices and edges and interpreting the binary 
predicate as the £ relation on the incidence graph defines 
FO e . Formally, the logical structure is (V U Ej,0. This 
allows to handle hyper-multigraphs without modifying the 
logical language. 

Proposition 2. Any graph property is FO-dejinable if and 
only if it is FO^-definable. 

See [B]. Therefore all results on the FO-logic of graphs 
are true on the FO e -logic of graphs (consequently, FO e is 
denoted FO). 

A sentence is a formula without free variables, and the 
quantifier-depth of a formula is defined as the maximal 
nestedness of a quantifier in the formula. A structure H 
satisfies a formula ip, denoted H |= ip, if <p is true on H. 
For a sentence <p and a sequence of random structures 
CHn)nGN, let fj,(<p,Hn) denote Pr [H n (= p] with H n ~ W. n , 
and let p(p,'H) denote the limit of this sequence (called 
limiting probability) if it exists. A sequence of random 
structures (fH n ) has a convergence law (for FO-logic) if 
every sentence p has a limiting probability p,(p,'H) (it is 
a zero-one law if all limiting probabilities are either 0 or 
1). A sentence is (asymptotically) almost surely true if its 
limiting probability is 1, and an axiom is such a sentence. 
The theory of a set of axioms is the set of all sentences 
implied by the axioms. 

Definition 3. The limit theory (or almost sure theory) 
T-u of a random structure TL is the set of its almost surely 
true sentences. A limit hyper-multigraph is a countable 
hyper-multigraph that is a model of the theory, i.e. it 
satisfies all formulas of the limit theory. 

A limit theory is consistent and so always has a model, 
and unless the theory is complete (which happens only 



















if there is a zero-one law) there are infinitely many non¬ 
isomorphic models. These definitions can naturally be 
specialised to graphs, and bipartite graphs. 

To see the expressive power of FO on hyper-multigraphs, 
some syntactic sugar can be defined. For example, even if 
there is only one domain, vertices can be distinguished 
from edges by a formula, i.e. a; is a vertex if and only 
if v(x) is true, with v(x) = Vy. ->(y £ x). The formula 
characterising edges is naturally the negation of v(x), i.e. 
e(x) = By. (y £ x). This allows to define edge and vertex 
quantifiers as B v x = Bx. v(x) and B E x = Bx. e(x). That u 
and v are adjacent can be expressed by u o v = Be. ( u £ 
e A v £ e), and the size of an edge can be expressed by 
\e\ = 2 = BviV 2 ^u. (ui £ e A V 2 £ e A vi 7 ^ V 2 A ((zt 7 ^ 
vi A 11 / V 2 ) => u e)) where v (£ e denotes ->{v £ e). 
Similarly, note that |e| = k, as well as any O/- (iq, ..., Vk) 
or (to, fc)-adjacency relation, is definable by a first-order 
formula. Furthermore, it is possible to define subclasses of 
hyper-multigraphs in first-order logic: 

Proposition 3. The subclasses Gn, G and H o/Hn are 
FO-definable. 

A hyper-multigraph is a multigraph if every edge is of 
size 2, and a graph if there are no multiple edges. This can 
be expressed by a first-order formula <Fg = (V B e. (|e| = 
2)) A -i( 3 eie 23 uv. (u £ e\ A v £ e\ A u £ e 2 A v £ e 2 A e\ 7 ^ 
62 )) while <E>g n = (V B e. (|e| = 2)). Therefore, for all H £ 
Hn, H |= if and only if H £ G. A hyper-multigraph is a 
hyper-graph (is in H) if there are no multiple hyper-edges, 
which can be expressed by the formula -i( 3 ei 3 e 2 Vu. (ei 7 ^ 
e 2 A (v £ e\ ££ v £ e 2 )). It follows that convergence laws 
can be deduced on subclasses by interdefinability: 


Proposition 4. Let hi be a random hyper-multigraph hav¬ 
ing a convergence law. Then the random graph Q defined as 
the distribution induced by TL on graphs has a convergence 
law, with 




/i(4 ) G,’H) 


III. The random hyper-multigraph with given 

DEGREE SEQUENCES 


In this section, we define some restrictions on the class 
of sequences we consider, which allow us to prove that 
this class satisfies properties that imply a convergence 
law. After the definitions and statements, the proof is 
given first assuming these properties. Then in the fol¬ 
lowing subsection the theorem is proven for trivial pairs 
of distributions, to simplify subsequent arguments. The 
following subsections prove that the properties hold with 
high probability, using a key lemma showing that the 
expected number of realisations of small subgraphs can 
be computed from the configurations. 

To get a convergence law, it is necessary to put some 
restrictions on the sequences considered. Three conditions 
encompassing a large class of degree sequences are given, 
and entail three properties of the corresponding sequences 


of graphs. The main theorem is then that the random 
sequence defined by the three former conditions corre¬ 
spond exactly to the theories satisfying the three latter 
properties. 

The considered sequences of random incidence graphs 
satisfy, : 

(i) V ,v and V' e converges weakly to d v and d e (in p(N)). 

(ii) There exists a N such that Vn > N,Vk £ N, d v (k ) = 
0 implies that d v n (k) = 0 and d e (k) = 0 implies that 
d e n {k) = 0. 

(iii) For (V.d) being (‘ D v ,d v ) or ( T> e ,d e ), 

lim ^ n ^ k 2 = k 2 d(k) < 00 (*) 

n—»oo ' Tl ' 

i= 1 

y fc 4 _ (**) 

i=l n 

Condition (i) signifies that the degree sequences converge 
to probability distributions on N. Condition (ii) signifies 
there is a threshold after which all represented degrees 
and edge sizes have to be in the support of the dis¬ 
tributions. Condition (iii) signifies that the distributions 
have a bounded second moment Q and that their tails 
are not too heavy (j**j), i.e. their fourth moment grows 
slower than n. Note that there are no requirements on the 
pair of distributions, so the distribution d e is completely 
independent of d v . 

Examples of distributions satisfying these properties 
are distributions with finite support (the case of Dirac 
distributions generating hyper-nrultigraphs with uniform 
edge sizes or regular degrees) or power law distributions, 
i.e. d(k) ss c • k~ a with a greater than 3. The power 
laws with exponent smaller than 3 do not have a sublinear 
fourth moment so do not satisfy condition (iii). A sequence 
that does not satisfy (ii) may have 0 vertices of degree k 
for even n and 1 vertex of degree k for odd n, violating 
the possibility of a convergence law since having degree 
k is a first-order property (note that such a sequence can 
still satisfy condition (i) and (iii), and therefore still have 
a local weak limit). 

The considered graph sequences satisfy: 

(I) Every acyclic neighbourhood has either no realisa¬ 
tions or an unbounded number of realisations. 

(II) The number of realisations of every unicyclic neigh¬ 
bourhood converges to a finite value. 

(Ill) Every subgraph with positive excess has asymptot¬ 
ically no realisation. 

Note that these three properties cover all types of sub¬ 
graphs, since a connected graph has excess —1 if and only 
if it is a tree, excess 0 if it is unicyclic, and positive excess 
if it has more than one cycle. 

These properties shed more light on conditions (i)- 
(iii). The conditions (i)-(iii) have two natures, the first 
is to avoid pathological sequences (condition (ii)), and 
the second one is to ensure that properties (I)-(III) hold 
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Fig. 3: Types of 3-spheres 


(condition (iii)). It may be proven that there is a con¬ 
vergence law even when relaxing conditions of the second 
sort, but in that case the limit theory will be qualitatively 
different. Suppose that for all n > N, the sequence has 
to vertices of degree k for some IV, m £ N (therefore 
k supp(cT)). Then condition (I) is violated, as there 
are exactly m neighbourhoods with a vertex of degree k 
(and with high probability these neighbourhoods are all 
trees). For condition (iii), note that when the tails of the 
distributions get heavier, it means that there are more and 
more vertices of high degrees, and at some threshold these 
vertices account for such an important proportion of edges 
that they get connected to each others, realising subgraphs 
of positive excess, violating (III). 

Lemma 1. Let B('D v ,'D e ) be an incidence graph sequence 
satisfying (i)-(iii). Then it satisfies (I)-(III). 

This allows to state the main theorem: 

Theorem 1. Let B(T> V ,D e ) be an incidence graph se¬ 
quence satisfying (i)-(iii). It defines a sequence of random 
hyper-multigraphs that has a convergence law. 

In the following subsection, the proof is given assuming 
that Lemma [T| holds, as the truth of first-order formu¬ 
las is determined from neighbourhoods alone. Then the 
theorem is proven for the special cases of trivial pairs of 
distributions, allowing to simplify subsequent arguments. 
The next subsections prove that properties (I)-(III) hold, 
thereby proving Lemma [TJ 

A. Proof of the convergence law 

To prove the convergence law, first the notion of q- 
sphere is defined inductively. It is a finite subclass of 
the class of balls of radius q — 1 introduced to capture 
the expressive power of first-order logic. There is only 
one 1-sphere, where the root has no neighbours. Then 
a (g + l)-sphere is a root with at most q neighbours of 
each type of g-sphere. By definition there is only a finite 
number of types of g-spheres. See Figure [3] where black 
vertices mean that the vertex has no other neighbour. 
Intuitively, each quantifier allows to go further, and The 
neighbourhood of a vertex v can be mapped to a g-sphere 
by setting (inductively) to q the number of neighbours 
of v that are of some type of (g — l)-sphere if there are 
more than g such neighbours. Hanf locality lemma (see 
the standard textbook m states that two structures Gi, 
G 2 are equivalent up to g-quantifiers sentences if for every 
type T of (3 9 )-sphere, both G\ and G 2 have more than g 
neighbourhoods of type T or both have the same number 
of neighbourhoods of type T. By properties (I)-(III), for 


any q the number of neighbourhoods of each type is either 
unbounded, or converges to a finite value. It follows that 
every first-order sentence has a limiting probability. 

B. Trivial pairs of distributions 

To simplify subsequent arguments, in the other sub¬ 
sections it is assumed that supp (eT) is different from 
{0}, {1}, and {0,1}, i.e a pair of distribution is trivial 
if supp(eT) C {0,1}. Otherwise it would be necessary to 
add particular cases to every statement, so these pairs are 
treated as special cases here. Note that if this is the case, 
there is a convergence law (even a zero-one law), albeit 
a fairly uninteresting one; in particular, the number of 
unicyclic neighbourhoods is zero for every such neighbour¬ 
hood, while for non-trivial distributions it converges to a 
non-trivial value. If supp(eT) = {0}, then the set of almost 
sure formulas are simply formulas stating that there exist 
vertices, and formulas about non-adjacency. The first set 
gives an axiom scheme, and the second set follows from 
the axiom -<(3e3v. v £ e). There is, up to isomorphism, 
only one countable graph that models these axioms (the 
countable empty graph). Now if 1 £ supp(ef’), there are 
hyper-edges in the graph, but they are all isolated, i.e. 
->(3v3eie2. v £ e\ A v £ e2 A e\ 62). There is an axiom 
scheme for every edge size k in the support of d e , stating 
that there are at least to such hyper-edges. There is, up to 
isomorphism, only one countable graph that models these 
axioms, a countable collection of isolated hyper-edges of 
size k for each k G supp(d e ), along with a countable 
collection of isolated vertices if 0 G supp(ef'). Note that 
these arguments hold for every possible d e . From model- 
theoretical arguments, the uniqueness of the countable 
model of the limit theory implies a zero-one law, and 
therefore convergence. 

G. Proof of property (I): the local weak limit 

The property (I) is proved using the local weak limit; the 
idea is to look at the ‘typical’ neighbourhood of a vertex. 
With high probability this neighbourhood is a tree, and 
the tree observed by picking a vertex uniformly at random 
converges to a simple random object, a branching process. 
This branching process gives that any tree that can be 
realised is realised infinitely often. 

A graph G induces a probability distribution pc on 
rooted graphs by taking a uniform root among all vertices, 

^ G = 

1 1 vev 

where S is the Dirac distribution over the isomorphism 
class of the rooted graph (G, v). A sequence of random 
graphs ( Q n ) has weak local limit p if pg n p, i.e. the 
sequence pg n converges weakly to p (for details on weak 
convergence in metric spaces, see U).A branching process 
is a random rooted tree, defined by a root (generation 0), 
with each vertex in generation i having a random number 
of offspring vertices in generation i + 1. Let BGW(d v , d e ) 


Root 

• 

V 

Generation 1 

/ 

• 

\ 

• 

E 

Generation 2 

/I \ 

• • • 

• 

V 


Fig. 4: First generations of a bipartite branching process 


be a two-sorted Galton-Watson branching process, where 
each point has an independent number of offsprings. It is 
defined inductively by a root of type V (respectively E ), 
with x ~ d v offsprings of type E (respectively x ~ d e 
offsprings of type V). Then a point in generation i > 0 
has x ~ d* offsprings, with d being d v if the point is of 
type V and d e if it is of type E, with 


d*(x) 


(x + 1 )d(x + 1 ) 

E y yd{y) 


(this accounts for the edge already present between the 
point and its parent). An example is given in Figure 2J 


Lemma 2. If B(V V ,V e ) satisfies (i)-(iii), then pg 
BGW(d v ,d e ). 


The proof is based on the exploration of the connected 
component containing the root, and can be adapted from 
the standard proof of convergence for the configuration 
model (see m for details). It allows to deduce property 
(I) from the following consequence: 

Corollary 1. Let B(T> V ,T> e ) satisfy (i)-(iii) and T be a 
bipartite tree, the following are equivalent: 

• T is asymptotically realised infinitely often. 

• T is realisable. 

• Vi> £ Vt (resp. Ve £ Et), deg(u) £ supp(eT) (resp. 
deg(e) £ supp(ef )j. 


Lemma 3. Let H be a (connected) subgraph with k v 
vertices, k e edges, m G-edges, c v its number of vertex- 
automorphisms, c e its number of edge-automorphisms. Let 
d v ~ d v , d e ~ d e be random variables, and recall that 
exc (H) = to — (k v + k e ). Then the expected number of 
realisations of H is 

-exc(flj n^[(^ L )deg H (ui)] ' II ^[(^ e )deg H (e,)] 

'' c v c e (Ed v ) m - k ‘(Ed e ) k ‘ 

Here a vertex (resp. edge) automorphism is an auto¬ 
morphism that maps vertices to vertices (resp. edges to 
edges). Note that this is the expectations of the number 
of embeddings, so if H has degrees that are not in the 
support of the distribution but are smaller than degrees 
in the support, this number will be positive. Contrarily, if 
H contains some degree l bigger than any degree in the 
support, then the expectation of the /-falling factorial is 0 
hence the whole expression goes to 0 . 

This formula is proven directly from the configuration. 
The expected number of realisations of H can be written as 
T7- E r E[p(r(F I), G)], where the sum is over all injective 
maps from Vh to Vq and Eh to Eg, and p is the number 
of homomorphisms for a given map, i.e. p is positive if r is 
a homomorphism and 0 otherwise. Recalling that S is the 
number of £-edges, there are (n)k v ■ ( 5 ^p)k e such maps, as 
there are n = vertex equivalence classes, and edge 
equivalence classes. The division by the number of auto¬ 
morphisms accounts for the repetitions of maps. The first 
£-edge of the homomorphism between vertex V\ and edge 
e\ is matched with probability deg(r(i;i))-deg(r(ei))/5, as 
there are deg(r(ui)), deg(r(ei)) nodes in the configuration 
that can be chosen to create the £-edge between v\ and e±, 
and it is chosen among S nodes. For the following edges, 
the number of nodes corresponding to vertices that have 
already been matched is correspondingly diminished, and 
similarly the number of nodes becomes (S — i) if i £-edges 
have already been matched. Define the product 


This corollary follows directly from the previous lemma, 
since the probability that B(T> V , V e ) is isomorphic to a tree 
T is given by BGW(T). If this probability is positive, then 
there is a linear number of vertices in B with neighbour¬ 
hood isomorphic to T. Since BGW(T ) > 0 if only if all 
of the degrees appearing in T are in the support of the 
distributions (d v ,d e ), property (I) follows. 

D. Subgraph counting lemma 

This subsection gives a formula for the expected number 
of small subgraphs and proves it. It proves the convergence 
of the expected number of realisations of subgraphs with 
non-negative excess, proving the remaining two properties 
in the next subsection. 

Recall that (n)*, denotes the fc-falling factorial of n. The 
expected number of realisations of a bipartite graph with 
non-negative excess can be computed from the following 
formula: 


k v 

p v= 

i=l 

and similarly Pe where the sum is over all edges. Now, a 
uniformly sampled set of k vertices has the same distri¬ 
bution as d® k , the fc-th product of d (to see this, the k 
vertices are distinct with probability ( n)k/n k which goes 
to 1 as n goes to infinity). Therefore 

k k 

2—1 2=1 


allowing to rewrite Py as Pfi = nf=i(^ t, )de gjT (t) 4 ) ( an d 
similarly Pe as P ' E ). Now we have lim n _ > 00 (5 1 — k)/n = 
for any k £ N, and therefore (S)k ~ n k (Ed v ) k . Putting all 
this together, we have 
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n ky + ke (lE(T')fce 

c v c e (S) m (Ed-) kP v E> 







Fig. 5: Bipartite graphs of excess 1 


By rewriting (S) m by n m (Ed v ) m , we obtain a factor 
n k v +k e -m , j e n -exc(if) _ Simplifying the expression by this 
rewriting yields the expected formula. 

E. Proof of properties (II) and (III): subgraphs of non¬ 
negative excess 

Now we prove properties (II) and (III). To prove prop¬ 
erty (HI), note that any graph with positive excess con¬ 
tains a subgraph of excess 1 which can be of three different 
forms, either it is constituted of two cycles sharing a vertex 
and its maximal degree is 4, or it is constituted of two 
cycles linked by a path, or a cycle with a chord, and in 
both the maximal degree is 3 (see Figure [5]). To deduce 
property (III), it is sufficient to see that graphs with excess 
1 are not realisable: 

Corollary 2. Let B(T> V ,T> e ) be a random incidence graph 
sequence satisfying (i)-(iii) and H be a bipartite graph with 
exc (H) = 1. Then H is not realisable. 

First note that by condition (iii), the average degree 
converges to a finite value, so it suffices to show that for 
a graph with positive excess, the value given by Lemma [3] 
goes to zero. Since this value is multiplied by n -exc ( ff - ) = 
n , it is sufficient to show that the numerator is in o(n), 
which is exactly the second part of condition (iii) since 
the maximal degree of a subgraph of excess 1 is either 
3 or 4. Then any graph with positive excess contains an 
embedded subgraph of excess 1 which is not realisable, so 
it is not realisable itself and property (III) follows. 

When applied to unicyclic graphs (graphs with excess 
0), the formula given by Lemma [3] gives property (II): 

Corollary 3. Let B(D V ,D e ) be a random incidence graph 
sequence satisfying (i)-(iii) and H be a bipartite graph with 
exc (H) = 0, then the expected number of realisations of H 
converges. 

By the first part of condition (iii), the numerator is 
finite, therefore every term in the expression is finite. 
There is in fact convergence in distribution to Poisson 
random variables of the number of cycles for each size of 
cycles (there is even joint convergence). Details can be 
found in a. M- This concludes the proof of convergence. 

IV. Limit theories and first-order contiguity 

In this section, we investigate the limit theories defined 
by the incidence graphs sequences B(D V , V e ). First, an 
axiomatisation of the limit theories is given, and the 
models of the limit theories are investigated. Then, as 
some sequences have the same limit theories, we look at 


sequences through the lens of first-order contiguity, a tool 
to compare random sequences from the point of view of 
logic. Then classical models of random graphs are analysed 
as special cases of this framework. The completions of the 
limit theories are studied next, and in the last subsection 
the models of the limit theories are investigated. 

A. First-order axioms 

An axiomatisation of the limit theory can be deduced 
from properties (I)-(III) and distributions ( d v 1 d e ). Let 
be the limit theory of B(V V ,T> e ). First, by prop¬ 
erty (II) this axiomatisation contains no sentence about 
unicyclic neighbourhoods, since each may be realised or 
not. Then property (III) gives that every sentence stating 
the absence of neighbourhoods of positive excess is in the 
limit theory. For all (r, s) e N 2 , let Z TtS be the formula 
stating ‘for every set of r vertices and s edges, there are 
not r+s+1 G-edges’, and Z be the axiom scheme of all such 
sentences. Then, for all k ^ supp(eT) (resp. k ^ supp(eT)), 
let Yjf (resp. Yf) be the formula stating ‘there is no vertex 
of degree exactly k\ and Y denote the axiom scheme of 
all Y v and Y e sentences. Note that this axiom scheme can 
be replaced by a single sentence if both supports of the 
distributions are finite or cofinite. Then, for all bipartite 
trees T, and all me N, let X ^ be the formula stating 
‘there are at least m exact realisations of T”, and T denote 
the axiom scheme of all such sentences. Here, an exact 
realisation means that for all internal nodes in T, the 
formula states that the vertex has exactly the degree it 
has in T. For leaves, the formula states that the vertex 
has degree 1 if and only if the node has no offspring in the 
branching process (since edges must be of size at least 2, 
leaves can only be vertices). Therefore the formula asserts 
the existence of T as induced components if and only if 
the branching process can die out isomorphic to T. The 
three axiom schemes X , Y, Z (with X and Z depending 
on the distributions) axiomatise the almost sure theory of 
B{V V ,V e ), i.e. T x . Y)Z =T d „, d ,. 

B. First-order contiguity 

To compare two sequences of random structures from a 
logical point of view, the notion of first-order contiguity 
is used, as an analogue to the probabilistic notion of 
contiguity of [T5] , 

Definition 4. Two sequences of random structures are 
first-order contiguous if and only if every sentence has 
a limiting probability either on the two sequences or on 
neither of them, and both sequences share the same almost 
sure theory. 

In other words, two sequences of random hyper¬ 
multigraphs are first-order contiguous if and only if for 
every first-order sentence p, ip is either not almost sure 
(in general it may not converge), or it is almost sure on 
the two sequences. 


Theorem 2. Two sequences of hyper-multigraphs de¬ 
fined, respectively by B(/D\,D\) and B(V 2 ,V 2 ) both sat¬ 
isfying (i)-(iii) are first-order contiguous if and only if 
(supp(di),supp(d®)) = (supp(d 2 ),supp(d 2 )). 

Which means that from the point of view of first-order 
logic, the statistical information given by the distributions 
does not matter, only the supports matter. Since there is 
a convergence law, the first property holds obviously, and 
the second property is immediate since the axiomatisation 
only depends on (supp(d u ), supp(d e )). Note that the lim¬ 
iting probability of sentences that are not in the almost 
sure theory may differ. In fact, this is the case exactly if 
the limit distributions differ: 

Corollary 4. There is a formula tp with n(ip, B(T>\,V\)) ^ 
p(q>,B{T>l,D$j) if and only if(d%,c%) ^ {d 2 ,d 2 ). 

Since all sentence probabilities are only dependent on 
the limit distributions, the ‘only if’ part is trivial. To show 
the other direction, suppose the two pairs of distributions 
have the same pair of supports (otherwise the proof is 
trivial), and suppose d\ is different from d 2 on values i and 
j (the case d e being symmetric). A formula that asserts 
the existence of a cycle has a non-trivial probability; 
then specifying the branching of vertices in the cycle to 
have exactly i neighbours gives a formula satisfying the 
requirement. 

As a final remark, the set of limit theories defined by the 
class of degree distributions considered here can be ordered 
by the supports of the corresponding distributions. Indeed, 
the set of realised subgraphs of a sequence B\ (D'{. Vf)) is 
included in the set of subgraphs realised by a sequence 
B2{T> 2 ,V 2 )) if the pair of supports of B\ is included in 
the pair of supports of B 2 , he. (supp(d)’),supp(d®)) C 
(supp^), supp^)). In this sense, there is a maximal 
theory for this class corresponding to the pair of supports 

(N,N). 

C. The case of the (multi)graph with given degree sequence 
and the Erdos-Renyi graphs 

Much work has been done on the configuration model, 
which generates the uniform simple graph on a given 
degree sequence V. This is a specific case that can be 
generated by the bipartite graph B(D,V{ 2 y) where T>{ 2 ] 
contains only edges, converging to <5{ 2 } the Dirac distri¬ 
bution with mass on 2. Note that in that case, the local 
weak convergence is a simple Galton-Watson, and that in 
the Lemma [3] the terms depending on the edges cancel 
out. Precisely, E d e = E[(d e ) 2 ] = 2, so for any realisable 
subgraph, 

IIj=l ^[(^ C )deg H (ej)] __ 

(E d e ) ke 

In particular the expected number of multiple edges is 
therefore A = (E[(d u )2]) 2 /(4(Ed u ) 2 ), and the uniform 
graph with degree sequence d is obtained by conditioning 


on the absence of double edges. Since the number of mul¬ 
tiple edges converges in distribution to a Poisson variable, 
the probability that the graph is simple is the probability 
of 0 in Poi(A), the Poisson distribution of parameter A. 

In the Erdos-Renyi graph Q(n,c/n), each edge is inde¬ 
pendently present with probability c/n. The degree of a 
vertex is therefore given by a binomial of parameter n — 1 
and c/n, i.e. Pr[deg(u) = k] = Q)(^) fc (l - ±) n ~ k . So the 
average degree is c, and the degree distribution converges 
to the Poisson distribution of parameter c. Therefore in 
the limit there are infinitely many vertices of each degree. 
Then Theorem [2] implies directly: 

Corollary 5. Let B(V,V^ 2 y) be a sequence of incidence 
graphs satisfying (i)-(iii) with supp(d) = N. The sequence 
of graphs it defines, conditioned on being simple, is con¬ 
tiguous to Q(n,c/n). 

There are no multiple edges in Q(n,c/n) by definition, 
so it is necessary to condition on simplicity. 

D. Completions of the limit theories and limiting probabil¬ 
ities 

The set of possible limit theories share a strong resem¬ 
blance to the theory of the Erdos-Renyi graph Q[n,c/n), 
and they can be completed in a similar fashion. A comple¬ 
tion t is a set of sentences such that T + t is a complete 
theory, i.e. every formula (or its negation) belongs to the 
theory. A convergence law is equivalent to the fact that 
there is a set C of completions of the limit theory, such 
that any two elements of C are mutually inconsistent. A 
g-completion is a sentence that completes the theory only 
up to q quantifiers. In the present case no completion is 
finite, however some finite formulas are g-completions: 

Corollary 6. Any sentence fixing the number of neigh¬ 
bourhoods in each realisable unicyclic type of (3 9 — l)/2- 
sphere is a q-completion of the theory of B(V V ,V e ). 

This observation was made in m for Erdos-Renyi 
graphs and is given here without a proof, as it can be 
adapted from the original paper without much modifica¬ 
tion. The argument follows from properties (I)-(III), the 
only difference is that the realisable types depend on the 
supports of the distributions. 

Other results of the paper can similarly be extended 
to the bipartite configuration model. In particular, since 
formulas with non-trivial limiting values are formulas 
about unicyclic neighbourhoods, it can be shown that 
the limiting values of formulas can be written using ba¬ 
sic arithmetic operations with the values in Poi(A) and 
BGW(d v , d e ), for any A given by the formula of Lemma [5] 
Conversely, any real number that can be expressed in such 
a form is the limit of a first-order formula. The automata- 
theoretic arguments used to show this also imply that 
the asymptotic probability of a formula (or a formula 
with a given asymptotic probability) can be computed in 



polynomial space and that this bound is optimal. Details 
can be found in El- 

E. Models of limit theories 

In this subsection, the countable models of the limit 
theory are investigated. 

By the given axiomatisation, models of the limit theo¬ 
ries are constituted of tree components and independent 
unicyclic components. However, if there are no vertices 
of degree less than 2, then the BGW branching process 
never dies, and there are no small components. Since there 
is a giant component and that with high probability this 
component is unique [20] , the graph is connected with high 
probability. However, the countable graphs that model the 
limit theory are not connected. As observed in m , this is 
a most roundabout way to show that connectivity is not 
a first-order property. 

Consider for example the limit theory of 8 ( 62 , 62 ), the 
2-regular multigraph. The multigraphs it generates are a 
collection of cycles. However, except for a finite number 
of vertices, all vertices belong to cycles of size larger than 
k, for any k. As the length of the cycles grows, formulas 
of first-order logic cannot distinguish long cycles from 
an infinite alternating path of vertices and edges. Since 
they realise the same g-spheres, two countable graphs that 
are collections of infinite paths satisfy the same set of 
first-order formulas, regardless of the number of infinite 
paths in each. Therefore the limit models of 8 ( 62 , 62 ) are 
constituted of a collection of infinite paths and a collection 
of small cycles. 

Now consider the limit theory of 8(63,62), the 3-regular 
multigraph. The multigraphs it generates are connected 
with high probability. However as n grows the small cycles 
grow further apart from each others (since two cycles 
at fixed distance realises a subgraph of positive excess). 
It follows that in the limit the cycles are in isolated 
components, and each vertex in the cycles is the root of two 
infinite trees with 3-regular vertices and 2-regular edges. 
Therefore the limit models of 8(63,62) are constituted of 
a collection of infinite trees with 3-regular vertices and 2- 
regular edges and a collection of unicyclic components. 

Last consider the limit theory of 8 (d v ,d e ) with 1 £ 
supp(eT). It generates hyper-multigraphs with induced 
components of finite size. By the local weak limit, every 
realised induced component is realised linearly often, and 
therefore appears infinitely often in the limit. Then in the 
limit there may be any number of infinite components 
(including none) since, from a first-order point of view, 
infinite components can be simulated by arbitrarily large 
finite components. Therefore we have: 

Proposition 5. Let 8 (d v , d e ) be a sequence satisfying (i)- 
(iii). If 0,1 ^ supp(cf') then all components of the limit 
incidence graph are infinite. Otherwise there may be any 
number of infinite components in the limit. 



Fig. 6: A triple edge or a double 3-hyper-edge 

The idea is the following. To prove the first statement, 
suppose there is a finite component in the limit, when 0 
and 1 ^ supp(cT). Let this finite component be called 
H and have radius l, then for any q greater than l the 
type of the g-sphere corresponding to H is not generated 
by BGW(d v , d e ), contradicting previous observations. For 
the second statement, it is sufficient to show that the 
same types of g-spheres are realised independently of 
the presence of infinite components. Let L\, L 2 be two 
countable models of the limit theory of B(d v , d e ). Suppose 
that H is an infinite tree component in L 2 and that L\ 
does not contain infinite components. For any g, let T be 
a type of g-sphere realised in H. By hypothesis, L 2 satisfy 
property (I) and therefore T is realised infinitely often in 
L±, in components of radius greater than q. 

V. Other properties of the model 

In this section, two properties of interest are studied, 
first the probability that a hyper-multigraph is a hyper¬ 
graph, and second that, unlike classical models of ran¬ 
dom graphs (including the graphs with specified degree 
sequences), the o-graph can be a sparse graph with non¬ 
trivial clustering coefficient. 

A. Probability to be a hyper-graph 

Let k ^ 3. Then the incidence graph of a double k- 
hyper-edge is a subgraph with k vertices, 2 hyper-edges, 
and 2k £-edges. Therefore the excess of this subgraph is 
2k — (2 + k) >0 (see Figure ©• Similarly, the incidence 
graph of a fc-multiple edge is a subgraph with 2 vertices, k 
edges, and 2k £-edges, with excess 2k — (2 + k) > 0. Since 
subgraphs with positive excess are not realisable, a hyper¬ 
multigraph is a hyper-graph if and only if there are no 
double edges. Therefore the property to have no multiple 
hyper-edge can be computed, as it is just the probability 
to have no 2-cycle. 

Using Lemma [5] to get 

_ (E[K') 2 ]) 2 ■ (E[(d e ) 2 ]) 2 
4(E<f’) 2 (Ecf) 2 

the probability of this property is then the probability of 
0 in Poi(A). 

B. Clustering coefficient 

In this subsection, we consider the o-graph induced by 
the incidence graph, and show how its clustering coefficient 
is affected by the choice of the distributions. Recall that 
x a y if they belong to a common hyper-edge of any 
size, so the o-graph is obtained by replacing every k- 
hyper-edge by a complete graph (a clique) on k vertices. 
The clustering coefficient is an important property in the 



study of social networks, as it measures how much ‘friends 
of my friends are friends’. There are different definitions 
depending on the context, the one considered here is the 
local clustering coefficient, C(v) = M'(v)/M(v), where 
M(v ) is the number of pairs of neighbour vertices of v 
(vertices at distance 2 in the incidence graph) and M’(v) is 
the number of connected pairs of neighbour vertices (pairs 
Vi, V 2 with v\ c u 2 ). If B(d v ,d e ) is a graph (d e = <5 2 ), 
then as observed earlier the number of o-triangles in the 
graph is given by Lemma [3] and so is in 0(1). Therefore in 
that case the average clustering coefficient goes to 0, and 
is 0 for almost all vertices. When there are fc-hyper-edges 
with k > 2, the clustering coefficient becomes 


C v 




't 1 ) 



with M = |e| — 1. 

v£e 


The average clustering coefficient is therefore given by 
C = with d v ~ d v and d e ~ d e . This 

formula shows that by choosing the correct distributions, 
it is possible to generate sparse random graphs with a 
specified clustering coefficient. 


VI. Conclusion and open questions 

The logical framework used here allows to extend several 
classical results on graphs to the very general structure 
of random hyper-multigraphs, solving the problem stated 
back in HI]: “Some more interesting problems would be 
to prove limit laws for structures with relations of degree 
greater than 2. Our techniques rely heavily on graph- 
theoretic concepts, and it is not obvious how to extend 
them to relations of higher degree” (edited). The solution 
is simple, and relies on no more than graph-theoretic 
concepts. 

The specific class considered here encompasses a wide 
range of possible hyper-multigraphs, but a wide range of 
distributions remains to be studied. As remarked in [2], 
the bipartite configuration model allows to use a larger 
class of distributions than the simple configuration model, 
since the maximal degree can be as big as n — o(n ), 
whereas for the simple configuration model the maximal 
degree must be in 0 (y/n) (otherwise the probability that 
a random configuration is a graph goes to zero). However 
for such distributions the methods used here (in particular 
the local weak convergence) cannot be used. When (**) 
does not hold, as the distributions get heavier tails, sub¬ 
graphs with positive excess appear, and when (*) does not 
hold the expected number of cycles becomes unbounded. 
When considering such distributions, there will be non¬ 
contiguous sequences defined by distributions with the 
same supports, and contiguity will also depend on the tail 
of the distributions. 

In another direction, only sparse graphs are considered 
here, but other regimes of interest could be studied, such 
as sublinear or dense regimes of bipartite graphs. Finally, 
other models of hyper-graphs can be considered in this 
setting, such as the model considered in [3], or models 


of multigraphs. Some models in the literature may have 

natural definitions in terms of incidence graphs, while 

some may be contrived. 
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